Speech coding using trajectory compression and multiple sensors

نویسندگان

  • Sorin Dusan
  • James L. Flanagan
  • Amod Karve
  • Mridul Balaraman
چکیده

This paper presents a new method of multi-frame speech coding based upon polynomial approximation of speech feature trajectories incorporating multiple sensor signals from microphones, accelerometer, electro-glottograph, and microradar. The trajectory polynomial approximation exploits the inter-frame information redundancy encountered in natural speech. The trajectory method is applicable to features such as spectral parameters, gain, and pitch. The method is suitable for application to a frame vocoder to further reduce the transmission bit rate. Multiple transducers increase the intelligibility and quality of the coded speech in noisy environments. Experimental results are obtained by embedding the new method into an enhanced mixed-excitation linear prediction vocoder. The resulting vocoder operates at 1533 bps and preliminary intelligibility and quality tests show results comparable to those of the original 2400 bps vocoder.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Real-Time Multiple-Description Coding of Speech Signals

When sending speech data over lossy networks like the internet, multiple-description (MD) coding is a means to improve the perceived quality by dividing the data into multiple descriptions which are then sent as separate packets. In doing so the speech signal can still be decoded even if only parts of these descriptions are received. The present paper describes the structure of a software which...

متن کامل

Comparing several models for perceptual long-term modeling of amplitude and phase trajectories of sinusoidal speech

The so-called Long-Term (LT) modeling of sinusoidal parameters, proposed in previous papers, consists in modeling the entire time-trajectory of amplitude and phase parameters over large sections of voiced speech, differing from usual ShortTerm models, which are defined on a frame-by-frame basis. In the present paper, we focus on a specific novel contribution to this general framework: the compa...

متن کامل

Digital Audio: from Lossless to Transparent Coding

We have seen rapid progress in high-quality compression of wideband audio signals. Today’s coding algorithms can achieve substantially more compression than was thought possible only a few years ago. In the case of audio coding with its bandwidth of 20 kHz and more, the concept of perceptual coding has paved the way for significant bit rate reductions. However, multiple codings can reveal origi...

متن کامل

Lossless and Perceptual Coding of Digital Audio

We have seen rapid progress in high-quality compression of wideband audio signals. Today’s coding algorithms can achieve substantially better compression than was thought possible only a few years ago. In the case of audio coding with its bandwidth of 20 kHz and more, the concept of perceptual coding has paved the way for significant bit rate reductions. However, multiple coding can reveal orig...

متن کامل

Speech compression a novel method pdf

Text summarization is a process that reduces the size of the text document. Purpose, we use part of speech tagging to recognize types of the text words. speech compression applications Compression rate is a scale to decrease the size of text summary. speech compression abstract A higher.This paper illustrates a novel method of speech compression and transmission. This method saves the transmiss...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004